Communication Performance Issues for Two Cluster Computers

نویسندگان

  • Francis Vaughan
  • Duncan A. Grove
  • Paul D. Coddington
چکیده

Clusters of commodity machines have become a popular way of building cheap high performance parallel computers. Many of these designs rely on standard Ethernet networks as a system interconnect. We have profiled the performance of some standard message passing communication on commodity clusters using MPIBench, a tool for benchmarking the performance of MPI routines that uses a highly accurate, globally synchronised clock. The results suggest that existing methodologies of performance characterisation are inadequate. Tests were performed on two clusters, one with a conventional network architecture of switches connected via a high bandwidth backbone, the other with a tetrahedral network topology that potentially provides for lower contention and higher bandwidth. Where packet loss does not occur, performance in either system is good and degrades smoothly with load. However, packet loss is found to occur at any load and the consequent invocation of the TCP/IP timeout and congestion control mechanisms affect performance to a much greater than expected level. The nature of many parallel programs causes overall performance to drop to the worst case rather than the average. The value of MPIBench in profiling communication in parallel systems is clearly demonstrated, particularly through its generation of probability distributions which allow detailed analyses of per-

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Parallel Spatial Pyramid Match Kernel Algorithm for Object Recognition using a Cluster of Computers

This paper parallelizes the spatial pyramid match kernel (SPK) implementation. SPK is one of the most usable kernel methods, along with support vector machine classifier, with high accuracy in object recognition. MATLAB parallel computing toolbox has been used to parallelize SPK. In this implementation, MATLAB Message Passing Interface (MPI) functions and features included in the toolbox help u...

متن کامل

Communication Issues in Parallel Computing across Atm Networks Communication Issues in Parallel Computing across Atm Networks

Cluster-based computing, which exploits the aggregate power of networked collections of computers, has drawn increasing attention in the parallel processing community. The success of cluster-based computing depends largely on the performance of the underlying network. Although Asynchronous Transfer Mode (ATM) was designed speciically to transport integrated multimedia data, the expected ubiquit...

متن کامل

SCI Multiprocessor PC Cluster in a WindowsNT Environment

Experiences and performance results of a multiprocessor cluster consisting of personal computers connected with PCI communication cards based on the SCI communication standard are presented. The SCI communication technology enables remote memory access to the physically distributed memory of the cluster from all over the network. The standard operating system WindowsNT extended with a small int...

متن کامل

Multiprocessor PC Cluster in a WindowsNT Environment

Experiences and performance results of a multiprocessor cluster consisting of personal computers connected with PCI communication cards based on the SCI communication standard are presented. The SCI communication technology enables remote memory access to the physically distributed memory of the cluster from all over the network. The standard operating system WindowsNT extended with a small int...

متن کامل

Eecient Molecular Dynamics on a Network of Personal Computers

The Genoa Active Message Machine (GAMMA) is a high-performance Active Messages-like communication layer implemented at kernel level as an extension of the Linux Operating System, and made available to user applications through a programming library. On low-cost clusters of Personal Computers (PCs) connected by Fast Ethernet, GAMMA achieves much better communication performance compared to publi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003